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Clinical isolates of staphylococci (Staphylococcus aureus and S. epidermidis) which cause serious infec- 
tions due to their intrinsic resistance to beta-lactamase-stable beta-lactam antibiotics (e.g., methicillin) carry 
the mecA gene. Song et aL. FEBS Lett. 221 : 1 67- 1 71 ( 1 987). This gene encodes a putative cell wall biosynthetic 
enzyme referred to as penicillin binding protein 2a (PBP2a). PBP2a. which binds beta-lactams only atconcen- 
5 trations well above therapeutic efficacy, apparently can functionally substitute for all the staphylococcal PBPs 
and permit growth when the host organism is threatened by beta-lactams. Hartman and Tomasz, J. Bacteriol. 
158:513-516 (1984). Wu et al., Antimicrob. Agents Chemother. 36:533-539 (1992) and Ryffel et al., Gene 
94:137-138(1990). 

The mecA gene is not a normal part of the staphylococcal genome. The organism which donated mecA 

10 to the staphylococci remains unidentified. Despite the uniform presence of mec>\ in methicillin-resistant clinical 
isolates, these isolates vary considerably in their degree of resistance to methicillin. This variation in pheno- 
typic expression within a population has been referred to as heterogenous expression. Matthews and Stewart, 
FEMS Microbiol. Lett. 22:161-166 (1984). Typically, most cells exhibit low-level resistance to methicillin and 
only a minority of the population express high-level resistance, perhaps only one in 10^ cells. Tomasz et al., 

/5 Antimicrob. Agents Chemother 35:124-129 (1991 ). Although expression of methicillin resistance is dependent 
upon the presence of FBP2a, it appears to be somewhat independent of the amount of PBP2a, suggesting 
important roles for other factors. Chambers and Hackbarth, Antimicrob. Agents Chemother 31:1982-1988 
(1987) and Murakami and Tomasz, J. Bacteriol. 171:874-879 (1989). 

Tn55 / insertional mutagenesis of methicillin-resistant S. aureus revealed numerous sites which influence 

20 the level of methicillin resistance but are not linked to mecA and do not perturb the expression of PBP2a. Berg- 
er-Bachi et al., Antimicrob. Agents Chemother 36:1367-1373 (1992); Kornblum et al„ Eur J. Clin. Microbiol. 
5:714-718 (1986); Berger-Bachi et al.. Moi. Gen. Genet. 219:263-269 (1989) and Maidhof et al., J. Bacteriol. 
173:3507-3513 (1991). Those factors described thus far generally depress the MIC of beta-lactam resistant 
strains. Some of the genetic loci which demonstrate such an effect on methicillin resistance were designated 

25 factors essential for methicillin resistance (fern). Berger-Bachi et al., MoL Gen. Genet. 219:263-269 (1989). In 
contrast to mecA, the genes which encode influential factors are probably present in both resistant and sus- 
ceptible strains of S. aureus and S. epidermidis. Information obtained from gene disruption studies of femA 
and femB in S. aureus indicated that in addition to enhanced sensitivity to methicillin, homogeneously methi- 
cillin-resistant S. aureus strains carrying such gene disruptions have a reduced glycine content in the pepti- 

30 doglycan component of their cell walls (Maidhof et al., J. Bactehol. 173:3507-3513 (1991)) and exhibit reduced 
rates of cell wall turnover and autolysis (de Jonge et al.. J. Bacteriol. 173:1105-1110 (1991)). 

Genetic factors, other than mecA, that influence the expression of methicillin resistance in S. epidermidis 
have, until now, not been described at the molecular level. The present invention provides DNA sequences 
encoding the FemA protein of Staphylococcus epidermidis. the FemA protein itself, and vectors a^d microor- 

35 ganisms comprising the femA gene of S. epidermidis. 

The present invention provides DNA sequences encoding the FemA protein of Staphylococcus epidermi- 
dis, including the natural gene sequence designated femA (SEQ ID NO:1). Thus, included in the present in- 
vention is any DNA compound that comprises an isolated DNA sequence encoding SEQ ID N0:2. SEQ ID N0:2 
is as follows: 
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Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu T/r Leu Arg Val Asp Pro Tyr 
100 105 110 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 ISO 1S5 . 160 

Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

i 

Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 



45 



50 



55 



EP 0 625 575 A2 



Pro Leu Ala Tyr lie Aan Phe Asp Glu Tyr He Glu 
245 250 



Glu Leu Asn Asn 
255 



Glu Arg Asn Val 
260 



Leu Asn Lys Asp Tyr Asn Lys Ala 
265 



Leu Lys Asp He 
270 



Glu Lys Arg Pro 
275 



Glu Asn Lys Lys Ala His Asn Lys 
280 



Lys Glu Asn Leu 
285 



10 



Glu Gin Gin Leu 
290 



Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
295 300 



Leu Lys Gin Glu 
305 



His Gly Asn Glu Leu Pro lie Ser 
310 315 



Ala Gly Phe Phe 
320 



15 



He He Asn Pro 



Phe Glu Val Val Tyr Tyr Ala Gly 
325 330 



Gly Thr Ser Asn 
335 



Arg Tyr Arg His 
340 



Phe Ala Gly Ser Tyr Ala Val Gin 
345 



Trp Lys Met He 
350 



20 



Asn Tyr Ala He 
355 



Glu His Gly He Asn Arg Tyr Asn 

360 



Phe Tyr Gly He 
365 



25 



Ser Gly Asp Phe 
370 

Lys Lys Gly Tyr 
385 



Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
375 380 



Asp Ala Asp Val He Glu Tyr Val 
390 395 



Gly Asp Phe He 
400 



30 



Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg 
405 410 



Thr Leu Lys Lys 
415 



Leu Lys Lys 



The natural femA sequence is ef|compassed by the present invention as a DNA compound which com- 
35 prises the isolated DNA sequence which is SEQ ID N0:1 . SEQ ID N0:1 is as follows: 



ATG AAG ATG AAG TTT ACG AAT TTG ACA GCT AAA GAA TTT AGT GAG TTT 48 

Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 

^ 1 5 10 15 

ACT GAT CGT ATG ACA TAT AGT CAT TTT ACA CAA ATG GAA GGT AAT TAC 9 6 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 
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GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GG A ATT AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GCA GCT TGT TTA TTA AC A GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 
Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 336 
Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 110 

CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 334 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 43 2 
Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
25 135 140 

GAA GGA TTC CAC AAA GGA TTT GAT CCT GTA TTA CAA ATC CGA TAT CAT 480 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT AAT GAT GTT TTA AAA AAC 528 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Vai Leu Lys Asn 
165 170 175 

ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 57 6 
Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA CCT ATA TTT AGG TCA 624 
Val Lys Vai Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

TTT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 672 
Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 768 
Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
^ 245 250 255 
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GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GAC ATT 816 

Giu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn' Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 9 60 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 

305 310 315 320 

ATA ATT AAT CCG TTT GAA GTA GTT TAC TAG GCT GGT GGA ACT TCA AAT 100 3 

He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC TAT GCG GTT CAA TGG AAG ATG ATT 1056 

Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 

AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 

Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

AGT GGT GAC TTT ACT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 

Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 

370 375 330 

AAA AAG GGC TAT GAT GCC GAT GTT ATA GAA TAC GTT GGT GAC TTT ATT 1200 

Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 

385 390 395 400 

AAA CCT ATT AAT AAA CCA ATG TAT AAC ATT TAT AGA ACA CTT AAA AAA 1248 

Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 

CTA AAG AAA 1257 
Leu Lys Lys 



The present invention also includes the protein encoded by SEQ ID NO: 1 in purified form. Also included 
are recombinant DNA vectors, including expression vectors, that comprise DNA sequences encoding FemA. 

The restriction site and function maps presented in the accompanying drawings are approximate repre- 
sentations of the recombinant DNA vectors discussed herein. The restriction site information Is not exhaustive; 
therefore, there may be more restriction sites of a given type on the vector than actually shown on the map. 

Figure 1 - A restriction site and function map of plasmid pPSJISO, 

Figure 2 - A restriction site and function map of plasmid pET-11A. 

The instant invention provides the femA gene of Staphylococcus epidermidis and all degenerate sequenc- 
es thereof, the protein encoded by the femA gene (FemA), and vectors and microorganisms comprising genes 
encoding the FemA protein. In the practice of the invention as exemplified herein, the FemA protein comprises 
the amino acid sequence, which is SEQ ID NO 2: 
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Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
1 5 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 

20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Lau Val Gly He Lys 
35 40 45 

Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 110 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 
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Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro lie Phe Arg Ser 
195 200 205 

Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

Pro Leu Ala Tyr lie Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

He lie Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 

30 325 3 30 335 

Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 



20 



25 



35 



Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

40 Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 

335 390 395 400 

Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 



45 



Leu Lys Lys 



The present invention also provides the natural femA gene found in Staphylococcus epidermidis, embod- 
ied in SEQ ID NO: 1: 
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ATG AAG ATG AAG TTT ACG AAT TTG ACA GCT AAA GAA TTT AGT GAC TTT 48 
Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
1 5 10 15 

ACT GAT CGT ATG ACA TAT AGT CAT TTT ACA CAA ATG GAA GGT AAT TAC 96 
Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GGA ATT AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GCA GCT TGT TTA TTA ACA GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

20 GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 3 36 
Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 110 

CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 3 84 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu lie Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 4 32 
Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

GAA GGA TTC CAC AAA GGA TTT (IaT CCT GTA TTA CAA ATC CGA TAT CAT 4 80- 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He- Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT AAT GAT GTT TTA AAA AAC 528 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
40 165 170 175 

ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 576 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 
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GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA COT ATA TTT AGG TCA 624 
Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro lie Phe Arg Ser 
195 200 205 

TTT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 67 2 
Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 7 68 
Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GAC ATT 816 
Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 

260 265 270 

GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 
Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 
Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 9 60 
Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

ATA ATT AAT CCG TTT GAA GTA GTT TAC TAC GCT GGT GGA ACT TCA AAT 10 08 
He lie Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC ^^T GCG GTT CAA TGG AAG ATG ATT 1056 
Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 

AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 
Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

AGT GGT GAC TTT AGT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 
Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

AAA AAG GGC TAT GAT GCC GAT GTT ATA GAA TAC GTT GGT GAC TTT ATT 1200 
Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 



AAA CCT ATT AAT AAA CCA ATG TAT AAC 
Lys Pro He Asn Lys Pro Met Tyr Asn 
405 



ATT TAT AGA ACA CTT AAA AAA 1248 
He Tyr Arg Thr Leu Lys Lys 
410 415 



CTA AAG AAA 1257 
Leu Lys Lys 
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The synthesis of the FemA protein of the present invention nnay proceed by solid phase peptide synthesis or 
by recombinant methods. Both methods are described in U.S. Patent No. 4.617,149. the entire teaching of 
which is herein incorporated by reference. Recombinant methods are preferred if a high yield is desired. The 
principles of solid phase chemical synthesis of polypeptides are well known in the art and may be found in gen- 
5 eral texts in the area such as Dugas. H. and Penney, C. Bioorganic Chemistry (1981), Springer- Verlag, New 
York. pgs. 54-92. 

Synthesis of the FemA protein can be achieved by recombinant DNA technology. Synthetic genes, the in 
vitro or in vivo transcription and translation of which will result in the production of the FemA protein may be 
constructed, by techniques well known in the art. Owing to the natural degeneracy of the genetic code, the skil- 

10 led artisan will recognize that a sizable yet definite number of DNA sequences may be constructed which en- 
code the FemA protein. All such genes are provided by the present invention. A preferred gene encoding the 
FemA protein is the natural femA gene of Staphylococcus epidermidis, which is SEQ ID NO: 1. This preferred 
femA gene is available on an - 3.7 kb EcoRI restriction fragment of plasmid pPSJ180, publicly available and 
on deposit in Escherichia coli DH5a at the National Center for Agricultural Utilization Research. 1815 North 

15 University Street, Peoria, Illinois 61604-39999, under accession number NRRL B-21 024 (date of deposit: De- 
cember a, 1992). A restriction site and function map of pPSJ180 is provided in Figure 1 of the drawings. 

The femA gene may be created by synthetic methodology. Such methodology of synthetic-gene construc- 
tion is well known in the art. See Brown et al. (1979) Methods in Enzymology, Academic Press, N.Y., 68:109- 
151. The femA DNA sequence may be generated using a conventional DNA synthesizing apparatus such as 

20 the Applied Biosystems Model 380A or 380B DNA synthesizers (commercially available from Applied Biosys- 
tems, Inc.. 850 Lincoln Center Drive, Foster City, CA 94404). 

To effect the translation of the FemA protein, one inserts the engineered synthetic DNA sequence in any 
of a large number of appropriate recombinant DNA expression vectors through the use of appropriate restriction 
endonudeases and DNA ligase. The synthetic femA gene should be designed to possess restriction endonu- 

25 clease cleavage sites at either end of the transcript to facilitate isolation from and integration into these am- 
plification and expression plasmids. The particular endonudeases employed will be dictated by the restriction 
endonudease deavage pattern of the parent expression vector to be employed. The choice of restriction sites 
are chosen so as to properly orient the FemA coding sequence with control sequences to achieve proper in- 
frame reading and expression of the FemA molecule. The FemA coding sequence must be positioned so as 

30 to be in proper reading frame with the promoter and ribosome binding site of the expression vector, both of 
which are functional in the host cell in which the FemA protein is to be expressed. The FemA protein may be 
expressed in any number of well-known eucaryotic or procaryotic hosts using known promoters and vectors. 
Some of the potential hosts.. in addition to E. coii, include the yeasts Saccharomyces cerevisiae and Pichia 
pastoris. Bacillus, and cells infected with baculovirus. 

35 To achieve efficient transcription of the synthetic gene, said gene must be operably assodated with a pro- 

moter operator region. In one practice of the invention, the promoter-operator region of the synthetic gene en- 
coding SEQ ID NO: 2 is placed in the same sequential orientation with respect to the ATG start codon of the 
synthetic gene as the promoter-operator occupies with respect to the ATG-start codon of the gene from which 
it was derived. Synthetic or modified promoter operator regions have been created and are well known in the 

40 art. When employing such synthetic or modified promoter-operator regions they should be oriented with re- 
spect to the ATG-start codon of the femA gene as directed by their creators. In one practice of the invention 
as exemplified herein, where the host cell is an E. Coli host cell, said promoter-operator region is the phage 
T7 promoter-operator region. 

A variety of expression vectors useful for transforming procaryotic cells are well known in the art. A pre- 

45 ferred vector for expression in an E coli host cell is derived from E coli plasmid pET-11 A, which comprises 
the phage T7 promoter A restriction site and function map of pET- 1 1 A appears in Figure 2 of the accompanying 
drawings. Plasmid pET-11Ais publicly available from Novagen, Inc. (565 Sdence Drive, Madison, Wl 53711) 
under catalog #69436-1. The preferred host strain is E coli BL21(DE3), also available from Novagen under 
catalog #69387-1. 

50 The techniques of transforming cells with the aforementioned vectors are well known in the art and may 

be found in such general references as Sambrook et al.. Molecular Cloning: A Laboratory Manual (1988), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY or Ausubel et al.. Current Protocols in Molecular Biol- 
ogy {^2^^). John Wiley & Sons, New York, NY and supplements. The techniques involved in the transformation 
of E coli cells used in the preferred practice of the invention as exemplified herein are well known in the art. 

55 The precise conditions under which the transformed E coli cells are cultured is dependent on the nature of 
the E CO// host cell line and the expression or cloning vectors employed. For example, vectors which incorporate 
thermoindudble promoter-operator regions, such as the ct857 thermoinducible lambda-phage promoter-oper- 
ator region, require a temperature shift from about 30 to about 40*^0. in the culture conditions so as to induce 
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protein synthesis. 

In a preferred embodiment of the invention E, coli K12 BL21 {DE3) cells were employed as host cells but 
numerous other cell lines are available. The transformed host cells are then plated on appropriate media under 
the selective pressure of the antibiotic corresponding to the resistance gene present on the expression plasmid. 

5 The cultures are then incubated for a time and temperature appropriate to the host cell line employed. Spe- 
cifically, with E. CO// K12 BL21 (DE3) cells, the femA gene is placed under the control of a promoter transcribed 
specifically by the T7 RNA polymerase. Induction of transcription of the femA gene is accomplished by the 
addition of isopropylthiogalactoside (IPTG) to the growth medium^ which induces expression of the T7 RNA 
polymerase gene under the control of the lacUVS promoter. The T7 RNA polymerase is then available to tran- 

10 scribe the femA gene. 

General techniques of protein purification are well-known to those of ordinary skill in the art. See Creight- 
on,T.E.. Protein Structure: A Practical Approach {^9S9), IRL Press, Oxford, England and Boliag, D.M, and Edel- 
stein. S.J., Protein Methods (1991), Wiley-Liss, New York, NY. Proteins which are expressed in high-level bac- 
terial expression systems characteristically aggregate in granules or inclusion bodies which contain high levels 

15 of the overexpressed protein. Kreuger et al. (1990) in Protein Folding, Gierasch and King, eds., pgs 136-142. 
American Association for the Advancement of Science Publication No. 89-1 8S, Washington, D.C. and Sam- 
brook et al., Molecular Cloning: A Laboratory Manual (^9QB), pp. 17.37-17.41, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY. The FemA protein sometimes aggregates into inclusion bodies when expressed 
under the control of phage T7 promoter. Such protein aggregates must be solubilized to provide further puri- 

20 fication and isolation of the desired protein product. A variety of techniques using strongly denaturing solutions 
such as guanidinium-HCl and/orweakly denaturing solutions such as dithiothreitol (DTT) are used to solubilize 
the proteins. 

Recombinantly produced proteins may be purified by a variety of techniques well known in the art such 
as ion exchange chromatography, size exclusion chromatography, electrophoresis, differential centrif ugation. 

25 reversed phase high performance liquid chromatography, immunoaff inity chromatography, and the like. Pro- 
tocols for use of these individual techniques or combinations thereof are well known in the art. Gradual removal 
of the denaturing agents (often by dialysis) in a refolding solution allows the denatured protein to assume its 
native conformation. The particular conditions for denaturation and refolding are determined by the particular 
protein expression system and/or protein in question. The S-sulfonates of the peptide molecules are converted 

:3o to the disulfide paired, folded FemA molecules using a combination of high pH and added thiol in substantial 
accordance with the teaching of Frank. B.H. et al.. (1981 ) in Peptides, Synthesis, Structure and Function. Pro- 
ceedings of the Seventh American Peptide Symposium (Rich. D.H. and Gross, E., eds.) pp. 729-738. Pierce 
Chemical Co., Rockford, IL. 

The femA gene may be used in gene disruption studies in Staphylococcus epidermidis. Although it is be- 

35 lieved that the FemA protein is involved in the formation of a pentaglycine bridge in the cell wall of the bacterium, 
gene disruption will allow one to ascertain the precise effect of the loss of the femA gene. Gene disruption 
experiments in Staphylococcus aureus have revealed that a loss of femA results in an -40% reduction in cell 
wall glycine content. A similar result might be anticipated for S. epidermidis. Once determined, this information 
can be used to generate an assay for agents which inhibit the FemA protein, and are therefore useful in com- 

40 bination with antibiotics to treat methicillin-resistant bacteria. 

The FemA protein of SEQ ID NO: 2 may be produced by recombinant methods. Recombinant methods 
are preferred if a high yield is desired. The present invention thus comprises a method for constructing a re- 
combinant host cell capable of expressing SEQ ID NO: 2, said method comprising transforming a host cell with 
a recombinant DNA vector that comprises an isolated DNA sequence of Claim 1. The present invention also 

45 comprises a method for expressing SEQ ID NO: 2 in a recombinant host cell; said method comprising culturing 
said transformed host cell of Claim 5 under conditions suitable for gene expression. 

The following Examples are provided to further illustrate and exemplify, but not limit the scope of, the in- 
vention. 

50 Example 1 

Source of the Staphylococcus epidermidis femA gene 

Isolation of Plasmid pPSJISQ 

55 

Alyophil of E. coli K^2 DH5(x/pPSJ180 can be obtained from the Northern Regional Research Laboratories 
(NRRL), Peoria, Illinois 61604, under the accession number NRRL B-21024 (date of deposit: December 8, 
1992). The pPSJ180 plasmid may be isolated from E. coli K12 DH5(t/pPSJ180 using techniques well-known 
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to those skilled in the art. See Sambrook et al.. Molecular Cloning: A Laboratory Manual (1988). Cold Spring 
Harbor Laboratory Press. Cold Spring Harbor. NY or Ausubel et al., Current Protocols in Molecular Biology 
(1989), John Wiley & Sons, New York. NY and supplements. 

5 isolati on of the Staphylococcus epidermidis femA gene via the polymerase chain reaction 

Isolated plasmid pPSJIBO is used as the template for the polymerase chain reaction at a concentration of 
1 0 ng/reaction. Ventf,™ DNA polymerase (2 units/^il. Catalog #254, New England Biolabs, 32 Tozer Road, Bev- 
erly. MA 01 9 15-9965) is used with standard Ventp™ DNA polymerase buffer (1X= 10 mM KCI, 20 mM Tris-HCI 

10 (pH 8.8 at 25^C), 10 mM (NH4)2S04. 2 mM MgSO^, 0.1% Triton X-100). The PCR primers used are AGATA- 
TAAAGATCTAGATGGGAGTTATGAA (SEQ ID NO: 3) and ATTTCATAATTAGATGGATCCCTTCTTAAAATC 
(SEQ ID NO: 4). The reaction is carried out by 3 cycles of 94°C for 1 5 seconds, 40^C for 15 seconds and 72^C 
for 1 minute followed by 20 cycles of 94^C for 15 seconds, 55°C for 15 seconds and 72*'C for 1 minute. 

The reaction is transferred to a Centricon 100 microconcentrator (Amicon, Inc., 72 Cherry Hill, Beverly. 

15 MA 01915) and washed with 1 ml of water. The microconcentrator is then subjected to centrifugation at 3000 
rpm in a microcentrifuge for 30 minutes. The reaction is then diluted to 200 yi\ (from -50 (.il) with IX Xbal re- 
striction enzyme buffer. To this is added 50 units of Xbal and 50 units of BamHL The DNA is then digested for 
90 minutes at 37''C. The DNA is phenol extracted and ethanol precipitated. 

20 Example 2 

Construction of an Expression Plasmid Containing the femA Gene 

The DNA created in Example 1 is then tigated to the 5,6 kb Xbal-BamHI fragment of pET-11A (available 
25 from Novagen. Inc. (565 Science Drive. Madison. Wl 53711) under catalog #69436-1. This plasmid is then 
transformed into E coli BL21 (DE3) (also available from Novagen. Inc. under catalog #69387-1) using tech- 
niques well known to those of ordinary skill in the art. 

Example 3 

30 

Expression of the FemA Protein 

E. coli BL21 (DE3) transformed with the femA expression plasmid are grown overnight in TY broth (per ^ 
liter 10 g tryptone. 5 g yeast extract and 5 g NaCI) and 100 |.ig/ml ampicillin. The cells are then diluted 1/50 i 
35 into TY broth + ampicillin and grown at 37''C for 60 minutes. Expression is induced by adding ispropylthioga- 
lactoside (IPTG) to 0.4 mM. Samples are taken at 0 and 6 hours and run on a % SDS-polyacrylamide gel using 
techniques described in Sambrook et al., Molecular Cloning: A Laboratory Manual (1988), pp. 18.47-18.59, 
Cold Spring Harbor Laboratory Press. Cold Spring Harbor, NY An induced protein band is visible by staining 
with Coomassie Blue at the predicted size of 49,000 daltons. 

40 
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SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E) COUNTRY: United States of America 

(F) ZIP: 46285 

(ii) TITLE OF INVENTION: FEMA GENE OF STAPHYLOCOCCUS 
EPIDERMIDIS, FEMA PROTEIN, AND VECTORS AND 
MICROORGANISMS COMPRISING THE FEMA GENE 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: C. M. Hudson 

(B) STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU20 6PH 

(v) COMPUTER READABLE FOPvM : 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: MaCinLOSh 

(C) OPERATING SYSTEM: Macintosh 7.0 

(D) SOFTT^ARE: Microsoft Word 5.1 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

20 

(B) LOCATION: 1..1257 



(xi) SEQUENCE DESCRIPTION: 

ATG AAG ATG AAG TTT ACG AAT TTG 
Met Lys Met Lys Phe Thr Asn Leu 
1 5 



SEQ ID N0:1: 

ACA GCT AAA GAA TTT AGT GAG TTT 48 
Thr Ala Lys Glu Phe Ser Asp Phe 
10 15 



i 

35 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ACT GAT CGT ATG ACA TAT ACT CAT TTT ACA CAA ATG GAA GGT AAT TAC 96 
Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 

20 25 30 

GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GG A ATT* AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GCA GCT TGT TTA TTA ACA GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 
Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 33 6 
Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 HO 

CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 384 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 432 
Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

GAA GGA TTC CAC AAA GGA TTT GAT CCT GTA TTA CAA ATC CGA TAT CAT 480 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT pkv GAT GTT TTA AAA AAC 523 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 57 6 
Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA CCT ATA TTT AGG TCA 624 
Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

TIT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 672 
Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 



55 
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10 



15 



CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 768 
Pro Leu Ala Tyr lie Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GAC ATT 816 
Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 
Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 
Glu Gin Gin. Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 960 
Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

20 ATA ATT AAT CCG TTT GAA GTA GTT TAG TAG GCT GGT GGA ACT TCA AAT 10 08 

He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC TAT GCG GTT CAA TGG AAG ATG ATT 10 56 
Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 

AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 
Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 .365 

AGT GGT GAC TTT AGT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 
Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

AAA AAG GGC- TAT GAT GCC GAT GTT ATA GAA TAG GTT GGT GAC TTT ATT 1200 
Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 

AAA CCT ATT AAT AAA CCA ATG TAT AAC ATT TAT AGA ACA CTT AAA AAA 1248 
Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 

CTA AAG AAA 1257 
Leu Lys Lys 
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(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



15 



20 



25 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Sex Asp Phe 
15 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 S5 60 

Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 50 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
i 100 105 110 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
45 165 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 



35 
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Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 
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Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 

210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 

225 230 235 240 



10 



Pro Leu Ala Tyr lie Asn Phe Asp Glu Tyv lie Glu Glu Leu Asn Asn 
245 250 255 

Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 



15 



20 



Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro lie Ser Ala Gly Phe Phe 
305 310 315 320 

He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 



25 



Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 



Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 



30 



Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 

370 375 380 

Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 

385 390 395 400 



35 



Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 



Leu Lys Lys 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
AGATATAAAG ATCTAGATGG GAGTTATGAA 30 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID.N0:4: 
ATTTCATAAT TAGATGGATC CCTTCTTAAA ATC 3 3 

Claims 

1. A DNA compound that comprises an isolated DNA sequence encoding SEQ ID NO: 2. 

2. The DNA compound of Claim 1 which comprises the isolated DNA sequence which Is SEQ ID NO: 1. 

3. A recombinant DNA vector that comprises the isolated DNA sequence of Claim 1 . 

4. A recombinant DNA vector of Claim 3 that further comprises a promoter positioned to drive expression 
of said isolated DNA sequence. 

5. A method for constructing a recombinant host cell capable of expressing SEQ ID NO: 2, said method com- 
prising transforming a host cell with a recombina nt DNA vector that comprises an isolated DNA sequence 
of Claim 1, 

6. A method for expressing SEQ ID NO: 2 in a recombinant host cell; said method comprising culturing said 
transformed host cell of Claim 5 under conditions suitable for gene expression. 

7. A recombinant host cell transformed with a recombinant DNA vector of Claim 3. 
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8. The protein, in purified form, encoded by SEQ ID N0:2. 
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FIG. 1 
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FIG. 2 
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